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MUTANT TN5 TRANSPOSASE ENZYMES AND METHOD FOR THEIR USE 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application claims the benefit of US provisional patent application number 
60/146,686, filed on August 2, 1999, which is incorporated herein by reference in its 
5 entirety. 

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH 

OR DEVELOPMENT 
This invention was made with United States government support awarded by the 
following agency: NIH, Grant No. GM50692. The United States government has 
1 0 certain rights in this invention. 

BACKGROUND OF THE INVENTION 
Bacterial transposons such as Tn5 evolved within the cell by maintaining a low 
mobility level. While necessary for the transposon to survive, the low mobility level has 
inhibited the ability of researchers to detail the molecular transposition process and to 

1 5 exploit the transposition process for use, e.g., in the development of new diagnostic and 
therapeutic resources. Tn5 is a conservative "cut and paste" transposon of the IS4 
family (Rezsohazy,R., Hallet,B., Delcour,J., and Mahillon,J, "The IS4 family of 
insertion sequences: evidence for a conserved transposase motif," Mol Microbiol. 
9:1283-1295 (1993)) that encodes a 53kD transposase protein (Tnp) that is responsible 

20 for its movement. The wild-type Tn5 transposase amino acid and nucleic acid 

sequences are known. Ahmed, A. and Podemski, L. The Revised Sequence of Tn5. 
Gene 154(1), 129- 13 0(1 995), incorporated by reference as if set forth herein in its 
entirety. A nucleic acid sequence that encodes wild-type Tn5 transposase is attached as 
SEQ ID NO:l. A polypeptide sequence encoded by SEQ ID NO:l which corresponds 

25 to wild-type Tn5 transposase is attached as SEQ ID NO:2. 

The Tnp protein facilitates movement of the entire element by binding initially 
to each of two 19bp specific binding sequences termed outside end (OE; SEQ ID NO:3), 
followed by formation of a nucleoprotein structure termed a synapse, blunt ended 
cleavage of each end, association with a target DNA, and then strand transfer 

30 (Reznikoff,W.S., Bhasin Davies,D.R., Goryshin,I.Y., Mahnke,L.A., Naumann,T., 
Rayment,!., Steiniger-White,M., and Twining,S.S., "Tn5: A molecular window on 



-1- 



WO 01/09363 



PCT/US00/21052 



transposition" Biochem. Biophvs. Res. Commun. 266:729-34 (1999)). Tn5 
transposase can also promote movement of a single insertion sequence by using a 
combination of OE and inside end (IE; SEQ ID NO:4) sequences. The IE is also 19bp 
long and is identical to OE at 12 of 19 positions (Fig. 1). In v/vo, Tn5 transposase 
5 exhibits a marked preference for OE in E. coli. Transposase recognition and binding to 
IE is inhibited in R coli by the presence of two dam methylation sites (GATC 
palindromes) which add four methyl groups per inside end sequence (IE^ also depicted 
as SEQ ID NO:4, methylation not shown) (Yin,J.C.P., Krebs,M.P., and Reznikoff,W.S., 
"Effect of dam Methylation on Tn5 Transposition," J. Mol Biol.. 199:35-45 (1988), 
10 incorporated by reference as if set forth herein in its entirety). This methylation reduces 
transposition by reducing protein-DNA primary recognition (Jilk, R.A., YorkJX, and 
Reznikoff, W.S., "The organization of the outside end of transposon Tn5," J. Bacteriol. 
178:1671-1679 (1996)). 

A principal roadblock to understanding how Tn5 transposes is the fact that 
15 purified wild-type Tnp has no detectable activity in vitro. Recently, a double mutant 
hyperactive form of transposase ("Tnp EK/LP") that promotes all steps of the 
transposition reaction in vitro was developed. The Tnp EK/LP protein differs from 
wild-type Tn5 Tnp at position 54 (Glu to Lys mutation) and at position 372 (Leu to Pro 
mutation), in addition to a non-essential but advantageous change at position 56 that 
20 prevents production of a so-called inhibitor protein. The modified hyperactive Tnp 
protein retains the dramatic preference for OE (or OE-like) termini of wild-type Tn5 
transposase. Tnp EK/LP has clarified many aspects of Tn5 transposition that were not 
previously adequately addressable in vivo. 

In vitro polynucleotide transposition is a powerful tool for introducing random or 
25 targeted mutations into a genome. Useful in vitro transposition systems based upon the 
Tn5 transposon are disclosed in US Patent No. 5,925,545 and International Publication 
No. WO 00/17343, both of which are incorporated herein by reference in their entirety 
as if set forth herein. 

A Tnp protein having an ability to discriminate between IE and OE and having a 
30 preference for binding IE is desired to permit directed nucleic acid transposition and to 
facilitate more complex transposition and genetic engineering strategies of the type 
disclosed in the above-mentioned patent and application than are available using a Tnp 
having a single specificity for OE. A Tnp having an enhanced preference for IE 1 ** is 
also desired because methylation of DNA in common dam+ bacterial hosts inhibits 
35 binding of existing Tn5 transposases and reduces the ability of existing transposases to 
facilitate movement of IE-defined transposons. 
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BRIEF SUMMARY OF THE INVENTION 
The present invention is summarized in that a transposase protein modified 
relative to wild-type Tn5 Tnp as disclosed herein preferentially promotes transposition 
of a target sequence flanked with wild-type TnJ transposon inside ends (IE) rather than 
5 outside ends (OE) without regard to whether the IE sequences are methylated. 

In a related aspect, the present invention is also summarized in that a transposase 
modified relative to wild-type Tn5 Tnp as disclosed herein has a preference for IE over 
OE and is hyperactive with regard to transposition frequency. 

In another related aspect, the present invention is also summarized in that a 
1 0 transposase modified relative to wild-type Tn5 Tnp as disclosed herein has a preference 
for IE over OE and catalyzes transposition at a high level even when the IE sequences 
are methylated. In contrast, wild-type Tn5 transposase does not efficiently recognize 
methylated IE sequences. 

In yet another related aspect, the present invention is summarized in that a 
15 transposase according to the invention includes a mutation relative to wild type Tn5 
transposase that either (1) alters binding of the transposase to the DNA termini or (2) 
enhances transposition or (3) both. The mutation can be end-sequence-specific (as in the 
exemplified embodiments that alter DNA binding) or non-specific (as in the exemplified 
embodiment that enhances transposition. 
20 In still another related aspect, the present invention is summarized in that a 

transposase according to the invention has (1) a greater preference for IE than OE and 
(2) differs from wild-type Tnp in at least one an amino acid selected from the group 
consisting of amino acid 58, amino acid 344 and amino acid 372. 

In still another related aspect, the present invention is summarized in that a 
25 transposase according to the invention differs from wild-type Tnp in that it contains at 
least one of a mutation from glutamic acid to valine at amino acid 58, a mutation from 
glutamic acid to lysine at amino acid 344, and a mutation from leucine to glutamic acid 
at amino acid 372. 

In another related aspect the invention is summarized in that the transposase of 
30 the invention can also exhibit a greater preference for IE by reducing the preference for 
OE. A mutation at amino acid position 8 relative to wild-type transposase can reduce the 
preference of a transposase for OE, and thereby increase the apparent preference for IE. 
A mutation from arginine to cysteine can accomplish this modification. 

A transposase protein of the invention can promote more transposition of an IE- 
35 flanked target sequence in vivo or in vitro than wild-type Tn5 transposase does. A 

suitable method for determining transposase enzyme activity in vitro is disclosed herein 
and in US Patent No. 5,925,545, incorporated herein by reference in its entirety. A 
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suitable method for determining transposase activity in vivo is disclosed herein. 

The modified Tn5 Tnp of the present invention differs from wild-type Tn5 Tnp 
by virtue of at least one a change to an amino acid position, where the change is selected 
from the group consisting of (1) a change at amino acid position 58 that reduces or 
5 eliminates a negative interaction between the Tnp and a methylated DNA residue and 
(2) a change at amino acid position 344 that alters DNA binding. In addition to the 
changes notedherein, the modified Tnp's of the invention can also include a change at 
position 56 (such as a Met to Ala change) that prevents production of the so-called 
inhibitor protein that interferes with transposition. Moreover, the mutant Tn5 

10 transposase proteins can contain mutations in addition to those noted above. Additional 
mutations relative to the wild-type Tn5 Tnp are disclosed below. The effect of each 
mutation is disclosed below and it is understood that the applicants have identified 
amino acid residues of the protein that have a direct impact upon function and that other 
modifications at the same positions can have effects comparable to, greater than or 

1 5 lesser than those noted on the preference of the protein for inside ends. The wild-type 
Tn5 transposase amino acid sequence is presented in the Sequence Listing. Exemplified 
changes relative to that wild-type transposase are presented in the text of this 
application. 

The invention is further summarized in that the enzymes disclosed herein 
20 facilitate a simple, in vitro system and method for introducing any transposable element 
from a donor DNA into a target DNA when the transposable element DNA is flanked on 
either side by IE termini inverted relative to one another. Few other requirements on 
either the donor DNA or the target DNA are envisioned. It is thought that Tn5 has few, 
if any, preferences for insertion sites, so it is possible to use the system to introduce 

25 desired sequences at random into target DNA. Therefore, it is believed that this system 
and method, employing the modified transposase described herein and a simple donor 
DNA, is broadly applicable to introduce changes into any target DNA, without regard to 
its nucleotide sequence. It will, thus, be applied to many problems of interest to those 
skilled in the art of molecular biology. 

30 Finally, although the changes noted herein are disclosed in terms of changes at 

the protein level, it is well within the ability of a skilled artisan to modify a 
polynucleotide that encodes a Tn5 transposase protein to encode the modified proteins 
of the invention. The skilled artisan also understands the degeneracy of the genetic code 
and understands that a plurality of codons can direct the production of a single amino 

35 acid residue. 

The invention will be more fully understood upon consideration of the following 
detailed description taken in conjunction with the accompanying drawings. 
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BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 
Fig. 1 depicts the structure of Tni transposase and the nucleotide sequences of 
Tni outside ends (OE), inside ends (IE), methylated inside ends (IE*^), and modified IE 
(IE12A). 

5 Fig. 2 is a schematic depiction of the molecular basis for a papulation assay for 

observing transposition in vivo. 

Fig. 3 A depicts the locations of mutations observed in four successive rounds 

(A, B, C, D) of mutagenesis/recombination. Fig. 3B depicts the in vivo transposition 

profile of each of the mutants of Fig. 3 A. 
10 Fig. 4 depicts the relative preference of mutant transposases obtained in 

successive rounds of mutagenesis/recombination for OE and IE in a dam- strain of E. 

coli. 

Fig. 5 A depicts a plasmid suitable for use in an in vitro transposition method. 
Fig. 5B depicts the transposition products obtained using a mutant transposase of the 
15 invention to catalyze transposition in vitro. Fig. 5C further characterizes the products of 
lane 2 of Fig. 5B. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Random mutagenesis studies performed on the Tnp gene have demonstrated that 
it is possible to isolate mutations in the protein that increase the rate of transposition 
20 (Krebs and Reznikoff, 1988; DeLong, A., and Syvanen, M., "Trans-acting transposase 
mutant from Tn5," PN.A.S. U.S.A. 88:6072-6 (1991); Wiegand, T.W., and Reznikoff, 
W.S., "Characterization of two hypertransposing Tn5 mutants," J. BacterioL 
174:1229-1239 (1992); Weinreich, M.D., Gasch, A., & Reznikoff, W.S., "Evidence that 
the cis preference of the Tn5 transposase is caused by nonproductive multimerization," 
25 Genes. Dev. 8: 2363-2374 (1994); Zhou, M., & Reznikoff, W.S., "Tn5 mutants that alter 
DNA binding specificity," J. Mol. Biol. 271:362-73. (1997), incorporated by reference 
as if set forth herein in its entirety). This makes transposase unique in comparison to 
most enzymatic proteins that appear to have evolved for maximum in vivo activity. 
This is likely due to the fact that high rates of transposition would be detrimental to the 
30 survival of the transposon and hence transposase has evolved to have an 'optimal' level 
that is much lower than its maximum attainable level. 

Applicants have isolated a set of related mutant transposase proteins that differ 
from wild type Tn5 transposase in that the mutant proteins show a preference in a 
transposition system for inside ends (IE), and in some cases, methylated inside ends 
35 (IE**), rather than for outside ends (OE), which are unmethylated because they lack a 
methylation site. The end preference of a mutant transposase can characterized either 
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(1) by the in vivo transposition frequency observed when it is used in a system in which 
the target polynucleotide is flanked with either OE and lE 1 ^ termini, or (2) by the ratio 
of the in vivo transposition frequencies observed when it is used in a pair of systems in 
which the target polynucleotide is flanked with OE and IE 1 ^ termini, respectively, 
5 Although applicants have exemplied a number of transposase proteins differing from 
wild-type Tn5 Tnp at one, four, five, and seven mutations, one can reasonably predict 
from their analysis the effects of particular individual mutations. 

EXAMPLE 

Overview 

10 A number of related methods were used to obtain the family of mutants disclosed 

herein. In a first method, the applicants obtained mutants that restored in vivo 
transposition activity to a mutated end binding sequence that is not recognized as a 
substrate by wild type transposase. In a second method, the applicants introduced 
directed mutations into certain products of the first method to determine the preferred 

1 5 structure of a mutant transposase according to the invention. 

In the first method, the mutant IE end binding sequence contains an adenine in 
place of thymine at position 12 ("IE12A"; SEQ ID NO:5). The thymine-to-adenine 
change in IE12A destroys one of the two methylation sites of wild-type IE. Applicants 
used sPCR, a combinatorial, random directed mutagenesis technique to obtain modified 

20 transposase proteins that can restore transposition activity to polynucleotides flanked 
with IE12A. sPCR was developed and described Stemmer, W.P., "Rapid evolution of a 
protein in vitro by DNA shuffling," Nature 370:389-391 (1994) and Stemmer, W.P., 
"DNA shuffling by random fragmentation and reassembly: in vitro recombination for 
molecular evolution," Proc. Natl. Acad. Sci. U.S.A. 91:10747-10751 (1994), both of 

25 which are incorporated by reference herein as if set forth in their entirety. Briefly, DNA 
is manipulated in vitro in the sPCR method to introduce point mutations and to allow 
random recombination within a population of mutant sequences. The mutated genes can 
be cloned into plasmids which can be selected for increased activity in vivo. Clones 
having desirable phenotypes are then used as substrates for subsequent rounds of 

30 mutagenesis/recombination and selection for a further improved phenotype. 

The sPCR method can be used in conjunction with a screen (instead of a 
selection) in which a modest number of colonies (~10 4 ) are analyzed per round (Crameri, 
A., Whitehorn, E.A., and Stemmer, W.P., "Improved green fluorescent protein by 
molecular evolution using DNA shuffling." Nat. Biotechnol. 14:315-319 (1996), and 
35 Zhang, J.H., Dawes, G., and Stemmer, W.P., "Directed evolution of a fucosidase from a 
galactosidase by DNA shuffling and screening," Proc. Natl. Acad. Sci. U.S.A. 94, 
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4504-4509 (1997), both of which are incorporated by reference herein as if set forth in 
their entirety. A papulation assay described in the Example that follows was used as a 
screen for transposase mutants that restore transposition activity to polynucleotides 
flanked with IE12A. The papiilation assay is a modification of the assay described by 
5 Krebs, MP., and Reznikoff, W.S., "Use of a Tn5 derivative that creates lacZ 
translational fusions to obtain a transposition mutant," Gene 63:277-85 (1988), 
incorporated by reference in its entirety as if set forth herein. In the papiilation assay, 
productive transposition in frame into an actively expressed gene results in formation of 
a p-Gal fusion protein. These cells turn blue (due to presence of X-gal) and grow at an 
10 increased rate within the colony (lactose utilization). The rate at which these papillae 
form can be used to compare transposition rates promoted by mutated proteins. 

The applicants determined the ability of mutant transposases identified using the 
first method to catalyze in vivo transposition of polynucleotides flanked either with wild- 
type OE or with wild-type IE in a dam- strain (i.e., the nucleic acid was unmethylated). 
1 5 Among the mutants identified using the method, applicants identified a mutant 
transposase ("Tnp sC7") that retained near-wild-type activity with OE-flanked 
polynucleotides but which had very high activity with IE-flanked polynucleotides and 
even higher activity with IE-flanked polynucleotides when tested in a dam+ strain (i.e., 
the nucleic acid was methylated, subsequently "IE MEw ). Tnp sC7 contains seven 
20 mutations relative to wild-type transposase. 

In the second method, it was subsequently determined that a related mutant 
transposase ("Tnp sC7v2.0") having only four of the seven mutations of Tnp sC7 
exhibited a still higher IE^OE activity ratio. Both Tnp sC7 and Tnp sC7v2.0 contain a 
mutation that inhibits OE related activity (R8C), two mutations that specifically increase 
25 IE* 02 related activity (E58V, E344K), and a mutation that increases transposition of 
polynucleotides flanked by either IE** or OE (L372Q). 

Obtaining the mutant Tnp's 

The modified papiilation assay used to screen for productive transposition is 

shown and described in Fig. 2. In a first plasmid ("pRZ9904 (IE 1 2 A/IE 1 2 A)") used, a 
30 pair of inverted IE12A termini flank a polynucleotide that contains a lacZ gene but 

which lacks both a promoter and a translational start site. A second plasmid 

("pRZ9905") used in the assay, encodes for transposase that can move the lacZ- 

containing polynucleotide. Plasmids having these attributes can readily be constructed 

by a skilled artisan. Materials and methods are detailed below. 
35 Five random mutants that suppressed the end sequence mutation and yielded 

papillae after an initial cycle of mutagenesis/recombination. Equal amounts of plasmid 
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encoding each of these mutant transposases were then used as the initial substrates for a 
second round of mutagenesis/recombination. Following this second round, the mutated 
transposase genes were cloned into vector DNA and screened a second time for 
transposition activity with IE12A-defined polynucleotides via the papulation screen. 
5 From this second round, a total of 6 active mutants were isolated. A third round of 
mutagenesis/recombination was then performed followed by screening for activity with 
the mutant end sequence. This time hundreds of colonies were positive for papulation. 
Of these, 7 were clearly more active than the others and were isolated to serve as a 
template for a fourth round. In the fourth round of screening there were hundreds of 

10 transposing colonies visualized during the screening process. None of these, however, 
were as active as the most active mutant from the third round of 
mutagenesis/recombination (Tnp sC6). 

The most active isolate from each round was sequenced and tested for 
transposition activity with IE12A defined transposons in a quantitative papulation assay 

15 (see Figs. 3 A and 3B). Fig. 3 A depicts the mutations relative to wild-type Tnp of Tnp 
sA5 (best first round papillator), sB2 (best second round papillator), sC6 (best third 
round papillator), and sD5 (best fourth round papillator). Fig. 3B depicts the 
transposition activities of the four isolates in vivo in the papulation assay. Tnp WT was 
also tested but failed to promote a single detectable transposition event. The mutation 

20 Q81H is the only mutation that distinguishes the most active mutant, sC6, isolated in the 
third round, from the noticeably less active mutant sB2, isolated in the second round. A 
second isolate from the third round, Tnp sC7, is similar to the fourth round isolate sD5, 
except that it has two additional mutations (D217A and E344K). Tnp sC7's activity with 
IE12A defined transposons is similar to that of the fourth round isolate sD5 (data not 

25 shown). 

The mutant Tnp's can promote transposition of OE- and IE-defin ed transposons 

The mutant Tnp's described above were of initial interest because they could 

restore transposition activity to transposons flanked with IE12A termini that are inactive 

in the presence of wild-type Tnp. Although the transposase mutants functioned 
30 increasingly well with these transposons, the transposition rate did not approach the level 

of activity that is required in vitro. In fact, the in vivo activity of Tnp sC6 with IE12A 

ends only restored activity to a level similar to that of Tnp WT with transposons defined 

by OE (data not shown). 

Interestingly, however, most of the isolated mutants were hyperactive with 
35 transposons defined by at least one of the native end sequences (IE or OE). These 

transposition preferences were determined in a dam- (DNA unmethylated) strain using 
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the mating out assay of GoryshinJ.Y., Kil, Y.V., and Reznikoff, W.S., "DNA length, 
and twisting constraints on IS50 transposition," Proc Natl Acad Sci U S A 
91:10834-10838 (1994), incorporated by reference as if set forth herein in its entirety. 

Fig. 4 depicts the in vivo transposition activity of 25 mutant Tnps with IE and OE 
5 defined transposons in the mating-out assay in a dam- strain, normalized to the activity 
of wild-type Tnp (6.5X1 0" 5 normalized to 1). In this assay in a dam- environment, Tnp 
WT shows generally equivalent activity levels whether the substrate polynucleotide is 
flanked with IE or with OE. On the other hand, many of the mutants exhibited higher 
activity with IE than with OE. This is not surprising, since the mutants were obtained in 

10 a screen using IE12A, which differs from IE at only 1 nucleotide. In contrast, OE differs 
from IE12A at 6 nucleotides. One mutant in particular, Tnp sC7, displayed a very 
interesting phenotype. It is markedly hyperactive with IE transposons while exhibiting 
little change in the frequency of OE transposon movement. The ability to discriminate 
between IE and OE is important because it facilitates multi-part transpositions that 

15 separately employ IE and OE ends, where a reaction can be directed one way or another 
by providing a transposase that prefers either IE or OE. A preference for IE over OE of 
greater than about 5-fold may be suitable, though a preference of greater than about 10- 
fold for IE is more preferred. A preference of greater than about 20-fold is still more 
preferred. Fig. 4 demonstrates that a skilled artisan can obtain such mutant transposases 

20 using the methods disclosed herein. In particular, mutants sBl , sC6, sC7, sDl and sD3 
are examples of such mutants. 

Even more significantly, however, Tnp sC7 is not only not inhibited for 
transposition activity by methylated IE (which reduces Tnp WT levels by -102) but 
actually prefers IE^ transposons to those flanked by IE, as is shown in the mating out 

25 results of Table 1 which indicate that transposition frequency with OE is reduced in a 
dam+ strain for both Tnp sC7 and Tnp WT. Since binding of Tnp to OE is not affected 
by dam methylation, this difference merely reflects the difference in transposition 
activity between dam+ strains and dam- strains. Despite this reduction, the rate of IE 
defined transposition facilitated by Tnp sC7 is even higher in the dam+ strain due to the 

30 presence of methylation in the ends. In contrast, methylation of the end sequence 

inhibits recognition by wild-type transposase. On the basis of its ability to discriminate 
between IE and OE, and because of its insensitivity to methylation of IE, subsequent 
attention was directed to Tnp sC7. 
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Table 1. In vivo transposition rate of Tnp WT and Tnp sC7. 
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JCM101 (dam-) RZ212 (dam+) 

IEfreq. OEfreq. IE/OE IE^freq. OEfreq. IE^/OE 

Tnp WT 6.5X10- 5 6.5X10 5 1.0 1.0X10 8 3.1X10" 6 3.2X10° 

TnpsC7 1.8X10 3 3.7X10 5 50 2.6X10° 3.3X10* 6 794 



The role of the individual Tnp sC7 mutations in transposition activity 

To understand how individual mutations affect the activity of the protein and in 
an effort to maximize the activity of the protein and its ability to discriminate between 
IE 1 ^ and OE, the applicants strategically prepared more variant transposases using 
information obtained from sC7, which as noted above, contained seven mutations 
relative to wild type transposase. Since a comprehensive trial of all possible 
combinations of these seven mutations was cumbersome (128 possible combinations) 
two classes of seven mutants each were engineered. In each mutant of the first class, a 
unique mutation from sC7 was engineered to revert to wild-type at that position. This 
set of so-called "minus one" mutant transposases included all possible mutants having 6 
of the 7 mutations. In each mutant of the second class, wild-type transposase was 
engineered to contain exactly one of the 7 mutations from sC7. The set of so-called 
"plus one" mutant transposases included all possible mutants having only 1 of the 7 
mutations. These mutant proteins were all assayed for in vivo transposition activity with 
IE*^ and OE by mating out assays. The results of the analysis are shown in Table 2. 
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Table 2. Results of 'minus ones/plus ones' analysis. 
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a. 'Minus ones' contain all mutations present in Tnp sC7 except at indicated 

position, e.g. R8C 'minus one' contains the wild-type argenine at position 8. 
1 5 b. 'Plus ones' are Tnp WT except that the indicated amino acid is mutated to the 
residue present in Tnp sC7. E.g. R8C 'plus one' contains a cysteine at amino 
acid 8. 

Sequence-specific mutations . The mutation E58V has the most profound effect 
of all the mutations on the activity of Tnp sC7. This mutation in the wild-type 
20 background (E58V 'plus one') increases IE^ related transposition by 40,000 fold while 
removal of the mutation from Tnp sC7 (E58V 'minus one 1 ) drops the total activity by 
more than 1 ,000 fold. The mutation has comparatively little effect on OE related 
activity. 

The mutation E344K exhibits a similar, though much weaker, sequence specific 
25 effect on activity. When removed from Tnp sC7 (E344K 'minus one'), IE^ related 
activity decreases by 5 fold while OE related activity is stimulated about 5 fold. This 
result is mirrored in the 'plus ones' data as E344K in the wild type background stimulates 
IE^ related activity 4 fold and decreases OE related activity about 5 fold. 

Non-sequence-specific mutations . The mutation L372Q strongly stimulates Tnp 
30 sC7 activity with IE 14 *. When removed from sC7 both IE^ and OE related activity are 
reduced to the same degree. When added to wild-type transposase, its 'plus one' 
phenotype stimulates activity with both substrates. 

Other mutations. The mutation R8C was the most interesting of the remaining 
four mutations. When the mutation is added to wild-type transposase, OE related 
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10 



transposition was reduced nearly 10 fold. When the mutation was removed from sC7 OE 
related activity increased approximately 2 fold. In a methylating host, its removal from 
sC7 had little effect on IE 1 ^ related activity while by itself it decreased IE 1 * 8 related 
activity to below detectable levels. None of the remaining three (A157T, T171S, 
D217A) mutations has much effect on either IE 1 ^ or OE related activity when removed 
from sC7. To determine whether any combination of these mutations would lead to an 
increase in overall IE^ related activity without sacrificing its specificity for IE 1 ** over 
OE, the applicants engineered pairwise reversions of sC7 at these positions as well as a 
triple-reversion mutant. The construct with all three of these mutations reverted had the 
second highest IE** 8 related activity and the best ability to discriminate between IE* 02 and 
OE (Table 3). This four-mutant construct, Tnp (R8C, E58V, E344K, L372Q), was 
renamed Tnp sC7 v2.0. 



Table 3. In vivo frequency of Tnp sC7 with indicated mutations reverted to wild type. 
Numbers in parenthesis indicate positions changed to wild-type. 
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1.18X10 5 


3.6 


502 


TnpsC7(157,171,217)a 


7.4X1 0- 3 


2.8 


7.1X10" 6 


2.1 


1042 



a Re-named sC7v2.0. 



25 



Abbreviations: OE = outside end 
IE = inside end 

IE 1 * 35 = dam methylated inside end 



Tnp sC7 is less than the sum of its individual mutations. The Tnp E58V (E58V 
•plus one*) mutant has an activity increase of 4X1 0 4 over Tnp WT alone. The composite 
mutant with E58V removed (E58V f minus one') has an increase of 1.8X1 0 2 over Tnp 
WT. Additivity would then predict that Tnp sC7 would have an activity increase of: 



30 



(4X10 4 )(1.8X10 2 ) = 7.2X10 6 
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However the stimulation of the composite mutation is actually much lower (2X10 5 ). In 
other words the increase obtained by adding E58V to the other 6 mutations is 1 .1X10 3 
fold, and not the 4X1 0 4 fold stimulation seen by adding E58V into the wild-type 
background. This may be because that E58V and E344K are both stimulating the same 
step (primary DNA binding) and are less than additive when combined. 

Tnp sC7v2.0 effectively transposes IE*** defined transposons in vitro 

The ability of Tnp sC7v2.0 to promote transposition of IE^ defined transposons 
in vitro was tested under the same conditions developed for movement of OE defined 
transposons by Tnp EK/LP (Goryshin, I.Y., and Reznikoff, W.S., "Tn5 in vitro 
transposition," J. Biol. Chem. 273:7367-7374 (1998), incorporated by reference as if set 
forth herein in its entirety). Substrate plasmid pGT4, a high-copy number pUC19-based 
vector in which inverted IE 1 ** end sequences flank a kanamycin resistance gene, was 
purified as a supercoiled monomer (see Materials and Methods). This plasmid was 
constructed so that digestion with PvuII restriction endonuclease causes release of the 
transposon from the donor backbone DNA (see Fig. 5A). Tnp sC7v2.0 was produced 
and purified by cloning the nucleotide sequence in an expression vector, expressing the 
protein in a host cell and isolating the protein from an extract from the host cell, all using 
standard methods known to a skilled artisan. 

Incubation with Tnp sC7v2.0 results in conversion of 66% of supercoiled 
plasmid into transposition products and intermediates after a three hour incubation. In 
Fig. 5B, lane 1 is unreacted substrate pGT4. The transposition activity promoted by Tnp 
sC7v2.0 on lE 1 ^ transposons is shown in lane 2 Lane 3 is the result of pGT4 digestion 
with PvuII restriction endonuclease. 

Though transposition reactions performed in this fashion can lead to many 
different DNA products, the reaction can be interpreted by analyzing certain diagnostic 
fragments as defined previously by Goryshin and Reznikoff (1998), supra. Fig. 5C is a 
reproduction of lane 2 of Fig. 5B. Band 1 is the excised transposon. It is an intermediate 
that has undergone double ended break from the plasmid but has not undergone strand 
transfer. Band 2 is the donor backbone DNA that is released upon double-ended 
excision of the transposon. These bands have the same molecular weight as the products 
of the PvuII digest shown in lane 3 of Fig. 5B. Band 3 represents substrate plasmid that 
has undergone cleavage at one transposon end. It migrates at the same position as 
linearized plasmid (data not shown). Bands 4 and 5 are two different types of strand 
transfer products. Band 4 is the result of a transposon inserting intermolecularly into an 
unreacted plasmid. This results in a relaxed circular DNA that is longer than the original 
substrate plasmid by the length of the inserted transposon. The bands denoted as 5 are 
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the result of intramolecular inversion events. These transposition products are the size of 
the transposon but circularized. These circularized transposition products can contain 
differing numbers of nodes and hence migrate in different positions on the gel. 

Structure/function analysis 
5 The hyperactive Tn5 transposase mutant Tnp sC7v2.0, which increases 

transposition of IE 1 ^ flanked transposons by 7.4X1 0 5 times, includes mutations at amino 
acids 8, 58, 344, and 372. Analysis of Tnp sC7 revealed that the hyperactivity conferred 
by both the E58V and E344K mutations depends upon the transposon termini sequence. 
This phenotype occurring for E58V was not surprising, as previous random mutagenesis 

10 screens performed with OE flanked transposons resulted in the isolation of proteins with 
mutations at either amino acid 47 or amino acid 54 which had sequence specific activity 
(Zhou and Reznikoff, 1997; Zhou,M., BhasiaA, and Reznikoff,W.S., "Molecular 
genetic analysis of transposase-end DNA sequence recognition: cooperativity of three 
adjacent base-pairs in specific interaction with a mutant Tn5 transposase," J. Mol. Biol. 

15 276:913-925 (1997)). The sequence specific activity of a mutation at amino acid 344 
that suggests that the residue also interacts with transposon end DNA, however, was 
indeed very surprising. 

Recently a protein-DNA co-crystal representing Tnp EK/LP complexed with 
pre-cut (no donor backbone) OE DNA has been solved. In this complex amino acid 58 

20 is shown to interact specifically with OE at position 10. This places the mutant residue 
in the vicinity of position 12, the nucleotide mutation that it was initially chosen to 
repress. Positions 10, 1 1, and 12 are all different between IE and OE and furthermore it 
is near one of two areas that contain major groove modification by dam methylase 
(position 1 1 of top strand and position 12 of bottom strand). 

25 The structure further shows that amino acid 344 interacts with DNA at position 7 

of OE. While position 7 is not one of the seven that distinguishes IE from OE, it is 
between both position 4 and the region of nucleotides 10, 1 1, and 12 that differentiate the 
two. It is therefore plausible that the activity difference of Tnp E344K with IE and OE 
can be attributed to context effects of these two regions. We therefore propose that both 

30 mutations E58V and E344K are interacting with end sequence DNA and alter 
transposase function at the level of primary sequence recognition. 

The leucine to glutamine mutation (L372Q), which was present in all sequenced 
mutants (sA5, sB2, sC6, sC7, and sD5), was both familiar and surprising. It was familiar 
because a hyperactive mutation at this position was previously isolated. It was 

35 surprising because the previously isolated mutation causes a leucine to proline (L372P) 
substitution at this position (Weinreich et al., 1994). The L372P mutation results in two 
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consecutive proline residues in an a helix. In the co-crystal this destabilizes amino acids 
372-390. It is proposed that the mutation changes the conformation of the catalytic 
domain in relation to the C-terminal dhnerization domain. This alteration can improve 
activity by increasing the distance between the N and C termini. The close proximity of 
these termini in Tnp WT is thought to decrease the rate of transposition of Tnp WT 
(Reznikoff et al., 1999). The wild-type leucine residue, visible in an earlier partial Tnp 
structure (Davies,D.R., Braam,L.M., Reznikoff, W.S. and Rayment,L, "The 
three-dimensional structure of a Tn5 transposase-related protein determined to 2.9-A 
resolution," J Biol Chem. 274: 1 1904-1 1913 (1999)), is buried in a hydrophobic pocket. 
It is likely that substitution with a glutamine destabilizes this hydrophobic packaging. 
Given that two different hyperactive mutations have been isolated by PCR-base random 
mutagenesis techniques that only result in single nucleotide changes in the codon and 
can only make five different amino acid substitutions at this residue (methionine, valine, 
arginine, proline, or glutamine), it is likely that a more direct mutagenesis approach such 
as codon randomization could reveal other interesting mutants. 

The mutation R8C, which reduces OE related transposition almost 10-fold in the 
wild-type background (R8C 'plus one') and increases it approximately 2-fold when 
removed from the sC7 background (R8C 'minus one') is less easy to interpret In the 
co-cry stallographic structure the arginine residue is not in an area of DNA contact. 
However this structure only represents a still picture of the complex after cleavage has 
occurred and it is possible that this region contacts DNA in the initial synapse. 

IE methvlation 

The inside end of the Tn5 transposon contains two GATC signal sequences that 
add four methyl groups into the major groove of each end. In this study, we were able to 
isolate a single mutation, E58V, which not only overcomes this binding inhibition but 
also appears to preferentially function on transposons in which the methyl groups are 
present (presumably due to increased binding affinity). Furthermore, the co-crystal 
structure of Tnp EK/LP complexed with pre-cleaved DNA shows that glutamate 58 
interacts directly with position 10 of OE in the major groove. This region is in the 
vicinity of the methyl group that is present on the adenine of the non-transferred strand 
of IE 1 **. The fact that a single amino acid change can result in this extreme change in 
phenotype leads us to propose that this methyl group alone is responsible for the 
inhibition of binding of transposase to IE"* The inhibition of binding by Tnp WT to 
IE 1 ** is likely caused by an interaction between this methyl group and the negatively 
charged side chain of glutamate 58. Replacing this residue with a valine can not only 
remove this unfavorable interaction but also lead to an increase in binding affinity due to 



-15- 



WO 01/09363 PCT/USOO/21052 

hydrophobic packaging between the side chain of the valine residue and this methyl 
group. 



General Materials and Methods 

Media and Reagents 

Papulation assays were performed using glucose minimal Miller media (Miller, 
j. 5 Experiments in Molecular Genetics. Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY. (1992)) supplemented with ampicillin, chloramphenicol, 
5-bromo-4-chloro-3-indolyl-p- D-galactoside, and phenyl-P-D-galactoside (Trp--XG-PG 
plates) as described previously by Zhou, M., & Reznikoff, W.S., supra. After 
transformation during site directed mutagenesis, the cells were outgrown in SOC media 
as indicated in the altered sites protocol (Promega). All other bacterial growth was 
performed in Luria Broth (Sambrook, J., Fritsch, E.F. & Maniatis, T. (1989). Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY.). When necessary, antibiotics (Sigma) were added at the following 
concentrations: ampicillin 100|ig/ml; chloramphenicol 20ng/ml; nalidixic acid 20^ig/ml; 
gentamycin 5ng/ml; tetracycline 15^g/ml. Taq DNA Polymerase, T4 polymerase, T4 
ligase, dNTFs, and all components of the Altered Sites Mutagenesis kit were purchased 
from Promega. Restriction enzymes were purchased either from Promega or New 
England Biolabs. Oligonucleotides used in site directed mutagenesis, sPCR, and 
sequencing were purchased from Research Genetics. Radionucleotides used in 
sequencing were from Amersham. 

Construction of plasmids 

Plasmid pGT4 was constructed as a high copy number plasmid containing a 
kanamycin resistance gene flanked by two inside ends. It is designed so that digestion 
with PvuII releases the transposon from its pUC vector backbone. 

The fourteen pRZ9905 (sC7) derivatives and pRZ9905 (sC7 version 2.0) were 
constructed by swapping restriction fragments between pRZ9905 and pRZ9905 (sC7). 

Bacterial Strains 

Cloning of plasmids and the directed evolution process were performed in JM109 
(Promega). The site-directed mutagenesis protocol utilized strain ESI 301 (Promega) 
and JM109. In the mating out assay, transposase plasmids were transformed into strain 
RZ212 [A(lac-proA,B), ara, str, recA56, srl, thi/pOX38-Gen], followed by conjugation 
into 14R525[F-nalr], 
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Directed Evolution Process 

Directed evolution of Tn5 transposase was carried out by sPCR, basically as 
described previously by Stemmer (1994). Forty micro liters (— 4^ig) of a pRZ9905 
plasmid mini prep (Wizard S V preps, Promega) were partially digested in a SOjil volume 
5 containing lOOmM Tris-HCl (ph = 7.0), 5nM MgCl, and 90ng of DNase I. After a 

7-minute incubation at room temperature, the reactions were stopped by adding EDTA to 
lOmM. Following the addition of loading dye (Sambrook et. al, 1989) the digested 
DNA was electrophoresed in a 2% NuSieve gel (FMC BioProducts) next to pGEM DNA 
markers (Promega). A gel slice containing DNA fragments of 200-600bp in size as well 

10 as a second slice containing DNA fragments from 600-1000bp in size were excised from 
the gel. The DNA from these two slices was purified separately by phenol chloroform 
extraction (Sambrook et. al, 1989). After ethanol precipitation, the DNA pellets were 
dried and resuspended directly into 50 \xl of an assembly reaction mix. The assembly 
mix contained 0.2mM dNTP's; 2.0mM MgCl; 50mM KC1; lOmM Tris-HCl (pH9.0 at 

1 5 25C); and 0. 1% Triton X-100. After addition of 0.5 units of Taq DNA Polymerase, the 
DNA was reassembled by the following thermo cycling program: 94C for 30seconds; 50 
cycles of 94C for 20 seconds, 65C for 1 minute, and 72C for 2 minutes; and cooling to 
4C. A standard PCR amplification reaction using 5jjl1 of the assembly reaction product 
as a DNA template was performed for each sample (200-600bp and 600-1000bp) to 

20 amplify the transposase gene. This transposase-encoding fragment was digested to 
completion with Aflll and Bglll and ligated into purified Aflll/BgUI digested vector 
DNA from pRZ9905. 

Ligation products were transformed into electrocompetent JM109 cells that 
contained plasmid pRZ9904 (IE12A/IE12A). After outgrowth the cells were plated on 

25 Trp— XG-PG plates with chloramphenicol and ampicillin selection. The plates were 

incubated at 32C for 14 days. At this time pRZ9905 plasmid DNA from all colonies that 
exhibited at least one papillae were isolated and re-transformed into the papulation assay 
to confirm their papulation plus phenotype. A total of 5 pRZ9905 derivatives (out of 
20,000 original colonies screened) were confirmed to be papillation plus. An equal 

30 amount of all five plasmids was then used as the substrate for a second round of 
mutagenesis and screening. This process was repeated for a total of four rounds of 
screening (~ 20,000 colonies / round) 

Quantitative Papillation Assay 

The IE12A in vivo transposition activity of Tnp WT, Tnp sA5, Tnp sB2, Tnp 
35 sC6, and Tnp sD5 were compared by a quantitative papillation assay. Competent ceUs 
of strain JM109 harboring plasmid pRZ9904 (IE12A/IE12A) were transformed with the 
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appropriate transposase-encoding version of pRZ9905. After outgrowth, transformed 
cells were plated on Trp— XG-PG plates with chloramphenicol and ampicillin. The 
plates were grown at 32C until colonies began to appear (-18 hours). Individual 
colonies were then picked with sterile sticks and spotted onto a fresh plate in a 4X4 grid 
pattern to evenly space all colonies. One plate of 16 colonies was spotted for each 
protein. Plates were incubated at 32C and quantified for transposition by observing the 
appearance of papillae at 24-hour intervals. Data are expressed as the average number of 
papillae present per colony. 

Mating out assays 

Mating out assays were performed as described previously (Yin et al., 1988; 
Goryshin et al., 1994). Bacterial cells with the transposon containing plasmids 
pFMA52-187 (with either two OEs or two IEs) and the F factor pox-Gen were 
transformed with the appropriate transposase encoding plasmid pRZ9905. The donor 
used for the library screening was the strain JCM101 [AlacZX74, raps, dam-3]. All other 
mating out was performed in E. coli strain RZ212 [A(lac-proA,B), ara, str, recA56, srl, 
thi]. The recipient strain used was 14R525[F-nalr]. A total of three assays were 
performed for each combination of transposase and end sequence. The values reported 
are the average of these three data points. 

In vitro transposition assays 

Substrate plasmid pGT4 was isolated from DH5a cells using a qiafilter plasmid 
mega kit (Qiagen). Supercoiled monomer plasmid was isolated from a 1% agarose gel 
by use of the qiaquick gel purification kit (Qiagen). Reactions were performed at 37C 
under conditions determined by Reznikoff and Goryshin (1998). The concentration of 
pGT4 was 35.5nM. Tnp sC7v2.0 was added to a concentration of 280nM. 

The foregoing is not intended to limit the scope of the invention. Rather the 
invention is understood to encompass all the variations and modifications that come 
within the scope of the appended claims. 
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1 . A mutant Tn5 transposase protein modified relative to wild-type Tn5 
transposase protein, the mutant transposase differing from the wild-type protein at a 

5 position selected from the group consisting of amino acid 58 and amino acid 372, 
wherein the mutant Tn5 transposase protein has a preference for Tn5 inside ends over 
Tn5 outside ends. 

2. A mutant Tn5 transposase as claimed in Claim 1 comprising a valine at amino 
acid position 58. 

10 3. A mutant Tn5 transposase as claimed in Claim 1 comprising a mutation at 

amino acid position 372. 

4. A mutant Tn5 transposase as claimed in Claim 1 comprising a glutamine at 
amino acid position 372. 

5. A mutant Tn5 transposase as claimed in Claim 1, wherein the mutant 
15 transposase differs from the wild-type protein at amino acids 58 and 372. 

6. A mutant Tn5 transposase as claimed in Claim 1 comprising a valine at amino 
acid position 58 and a glutamine at amino acid position 372. 

7. A mutant Tn5 transposase as claimed in Claim 1 further differing from wild- 
type Tn5 transposase at a position selected from amino acid 8 and amino acid 344. 

20 8. A mutant Tn5 transposase as claimed in Claim 7 wherein the amino acid at 

position 8 is cysteine. 
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9. A mutant Tn5 transposase as claimed in Claim 7 wherein the amino acid at 
position 344 is lysine. 



1 0. A mutant Tn5 transposase as claimed in Claim 7 wherein the mutant Tn5 
transposase differs from wild-type Tn5 transposase at amino acid position 8 and amino 

5 acid position 344. 

1 1 . A mutant Tn5 transposase as claimed in Claim 1 comprising a cysteine at 
amino acid position 8 and a lysine at amino acid position 344. 

12. A mutant Tn5 transposase as claimed in Claim 1 comprising a cysteine at 
amino acid position 8, a valine at amino acid position 58, a lysine at amino acid position 

10 344, and a glutamine at amino acid position 372. 

13. A polynucleotide that encodes a transposase as claimed in Claim 1 . 

14. A polynucleotide that encodes a transposase as claimed in Claim 12. 

15. A system for transposing a transposable DNA sequence in vitro, the system 
comprising: 

15 the mutant Tni transposase modified relative to a wild-type Tn5 transposase, as 

claimed in Claim 1 ; 

a donor DNA molecule comprising the transposable DNA sequence, the 
sequence being flanked at its 5'- and 3 f ends by a wild-type IE sequence; and 

a target DNA molecule into which the transposable element can transpose. 

20 16. A system as claimed in Claim 15 wherein the IE sequence is methylated. 
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17. A method for in vitro transposition, the method comprising the steps of: 
combining a donor DNA molecule that comprises a transposable DNA sequence 
of interest, the DNA sequence of interest being flanked at its 5- and 3 - ends by a wild- 
type Tn5 IE sequence, with a target DNA molecule and a mutant Tni transposase as 
claimed in Claim 1 in a suitable reaction buffer at a temperature below a physiological 
temperature until the modified transposase binds to the IE sequences; and 

raising the temperature to a physiological temperature for a period of time 
sufficient for the enzyme to catalyze in vitro transposition. 
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SEQUENCE LISTING 

<110> Reznikoff , William S 
Naumann, Todd A 

<120> Transposase Enzyme and Method for Use 

5 <130> 960296.96471 

<140> 
<141> 

<150> 60/146686 

<151> 1999-08-02 ^'' 

10 <160> 5 



<170> Patentln Ver. 2.1 



<210> 1 
<211> 1431 
<212> DNA 
15 <213> Transposon Tn5 

<220> 

<221> CDS 

<222> (1) . . (1428) 



<400> 1 

20 atg ata act tct get ctt cat cgt 
Met lie Thr Ser Ala Leu His Arg 
1 5 

ttc tct teg gcg gcg ctg ggt gat 
Phe Ser Ser Ala Ala Leu Gly Asp 
25 20 



gcg gec gac tgg get aaa tct gtg 48 
Ala Ala Asp Trp Ala Lys Ser Val 
10 15 

cct cgc cgt act gee cgc ttg gtt 96 
Pro Arg Arg Thr Ala Arg Leu Val 
25 30 



aac gtc gee gee caa ttg gca aaa tat tct ggt aaa tea ata acc ate 
Asn Val Ala Ala Gin Leu Ala Lys Tyr Ser Gly Lys Ser He Thr He 
35 40 45 

tea tea gag ggt agt gaa gee atg cag gaa ggc get tac cga ttt ate 
30 Ser Ser Glu Gly Ser Glu Ala Met Gin Glu Gly Ala Tyr Arg Phe He 
50 55 60 



144 



192 
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cgc aat ccc aac gtt tct gcc gag gcg ate aga aag get ggc gec atg 24 0 
Arg Asn Pro Asn Val Ser Ala Glu Ala lie Arg Lys Ala Gly Ala Met 
65 70 75 80 

caa aca gtc aag ttg get cag gag ttt ccc gaa ctg ctg gcc att gag 288 
5 Gin Thr Val Lys Leu Ala Gin Glu Phe Pro Glu Leu Leu Ala lie Glu 

85 90 95 

gac acc acc tct ttg agt tat cgc cac cag gtc gcc gaa gag ctt ggc 336 
Asp Thr Thr Ser Leu Ser Tyr Arg His Gin Val Ala Glu Glu Leu Gly 
100 105 110 

10 aag ctg ggc tct: att cag gat aaa tec cgc gga tgg tgg gtt cac tec 384 
Lys Leu Gly Ser He Gin Asp Lys Ser Arg Gly Trp Trp Val His Ser 
115 120 125 

gtt etc ttg etc gag gcc acc aca ttc cgc acc gta gga tta ctg cat 432 
Val Leu Leu Leu Glu Ala Thr Thr Phe Arg Thr Val Gly Leu Leu His 
15 130 135 140 

cag gag tgg tgg atg cgc ccg gat gac cct gcc gat gcg gat gaa aag 480 
Gin Glu Trp Trp Met Arg Pro Asp Asp Pro Ala Asp Ala Asp Glu Lys 
145 150 155 160 

gag agt ggc aaa tgg ctg gca gcg gcc gca act age egg tta cgc atg 528 
20 Glu Ser Gly Lys Trp Leu Ala Ala Ala Ala Thr Ser Arg Leu Arg Met 

165 170 175 

ggc age atg atg age aac gtg att gcg gtc tgt gac cgc gaa gcc gat 576 
Gly Ser Met Met Ser Asn Val He Ala Val Cys Asp Arg Glu Ala Asp 
180 185 190 

25 att cat get tat ctg cag gac aaa ctg gcg cat aac gag cgc ttc gtg 624 
He His Ala Tyr Leu Gin Asp Lys Leu Ala His Asn Glu Arg Phe Val 
195 200 205 

gtg cgc tec aag cac cca cgc aag gac gta gag tct ggg ttg tat ctg 6 72 
Val Arg Ser Lys His Pro Arg Lys Asp Val Glu Ser Gly Leu Tyr Leu 
30 210 215 220 



tac gac cat ctg aag aac caa ccg gag ttg ggt ggc tat cag ate age 
Tyr Asp His Leu Lys Asn Gin Pro Glu Leu Gly Gly Tyr Gin He Ser 
225 230 235 240 
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25 age ttc acg ctg ccg caa gca etc agg gcg caa ggg ctg eta aag gaa 
Ser Phe Thr Leu Pro Gin Ala Leu Arg Ala Gin Gly Leu Leu Lys Glu 
370 375 380 

gcg gaa cac gta gaa age cag tec gca gaa acg gtg ctg ace ccg gat 
Ala Glu His Val Glu Ser Gin Ser Ala Glu Thr Val Leu Thr Pro Asp 
30 385 390 395 400 

gaa tgt cag eta ctg ggc tat ctg gac aag gga aaa cgc aag cgc aaa 
Glu Cys Gin Leu Leu Gly Tyr Leu Asp Lys Gly Lys Arg Lys Arg Lys 
405 410 415 



att ccg caa aag ggc gtg gtg gat aaa cgc ggt aaa cgt aaa aat cga 768 
He Pro Gin Lys Gly Val Val Asp Lys Arg Gly Lys Arg Lys Asn Arg 
245 250 255 

cca gee cgc aag gcg age ttg age ctg cgc agt ggg cgc ate acg eta 816 
5 Pro Ala Arg Lys Ala Ser Leu Ser Leu Arg Ser Gly Arg He Thr Leu 
260 265 270 

aaa cag ggg aat ate acg etc aac gcg gtg ctg gec gag gag att aac 864 
Lys Gin Gly Asn He Thr Leu Asn Ala Val Leu Ala Glu Glu He Asn 
275 280 285 

..,,.„ ,„ ■.., * f . ' ' . * . 

10 ccg ccc aag ggt gag acc ccg ttg aaa tgg ttg ttg ctg ace age gaa 912 
Pro Pro Lys Gly Glu Thr Pro Leu Lys Trp Leu Leu Leu Thr Ser Glu 
290 295 300 

ccg gtc gag teg eta gee caa gee ttg cgc gtc ate gac att tat acc 960 
Pro Val Glu Ser Leu Ala Gin Ala Leu Arg Val He Asp He Tyr Thr 
15 305 310 315 320 

cat cgc tgg egg ate gag gag ttc cat aag gca tgg aaa acc gga gca 
His Arg Trp Arg He Glu Glu Phe His Lys Ala Trp Lys Thr Gly Ala 
325 330 335 

gga gee gag agg caa cgc atg gag gag ccg gat aat ctg gag egg atg 1056 
20 Gly Ala Glu Arg Gin Arg Met Glu Glu Pro Asp Asn Leu Glu Arg Met 
340 345 350 

gtc teg ate etc teg ttt gtt gcg gtc agg ctg tta cag etc aga gaa 1104 
Val Ser lie. Leu Ser Phe Val Ala Val Arg Leu Leu Gin Leu Arg Glu 
355 360 365 
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gag aaa gca ggt age ttg cag tgg get tac atg gcg ata get aga ctg 1296 
Glu Lys Ala Gly Ser Leu Gin Trp Ala Tyr Met Ala He Ala Arg Leu 
420 425 430 

ggc ggt ttt atg gac age aag cga acc gga att gcc age tgg ggc gcc 1344 
5 Gly Gly Phe Met Asp Ser Lys Arg Thr Gly He Ala Ser Trp Gly Ala 
435 440 445 

etc tgg gaa ggt tgg gaa gcc ctg caa agt aaa ctg gat ggc ttt ctt 13 92 
Leu Trp Glu Gly Trp Glu Ala Leu Gin Ser Lys Leu Asp Gly Phe Leu 
450 455 460 

10 gcc gcc aag gat ctg atg gcg cag ggg ate aag ate tga 1431 
Ala Ala Lys Asp Leu Met Ala Gin Gly He Lys He 
465 470 475 



<210> 2 
<211> 476 
15 <212> PRT 

<213> Transposon Tn5 

<400> 2 

Met He Thr Ser Ala Leu His Arg Ala Ala Asp Trp Ala Lys Ser Val 
! 5 10 15 

20 Phe Ser Ser Ala Ala Leu Gly Asp Pro Arg Arg Thr Ala Arg Leu Val 

20 25 30 

Asn Val Ala Ala Gin Leu Ala Lys Tyr Ser Gly Lys Ser He Thr He 
35 40 45 

Ser Ser Glu Gly Ser Glu Ala Met Gin Glu Gly Ala Tyr Arg Phe He 
25 50 55 60 

Arg Asn Pro Asn Val Ser Ala Glu Ala He Arg Lys Ala Gly Ala Met 
65 70 75 80 

Gin Thr Val Lys Leu Ala Gin Glu Phe Pro Glu Leu Leu Ala He Glu 
85 90 95 

30 Asp Thr Thr Ser Leu Ser Tyr Arg His Gin Val Ala Glu Glu Leu Gly 
, 100 105 110 
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Lys Leu Gly Ser He Gin Asp Lys Ser Arg Gly Trp Trp Val His Ser 
115 120 125 

Val Leu Leu Leu Glu Ala Thr Thr Phe Arg Thr Val Gly Leu Leu His 
130 135 140 

5 Gin Glu Trp Trp Met Arg Pro Asp Asp Pro Ala Asp Ala Asp Glu Lys 
145 150 155 160 

Glu Ser Gly Lys Trp Leu Ala Ala Ala Ala Thr Ser Arg Leu Arg Met 
165 170 175 

Gly Ser Met Met Ser Asn Val He Ala Val Cys Asp Arg Glu Ala Asp 
10 180 185 190 

He His Ala Tyr Leu Gin Asp Lys Leu Ala His Asn Glu Arg Phe Val 
195 200 205 

Val Arg Ser Lys His Pro Arg Lys Asp Val Glu Ser Gly Leu Tyr Leu 
210 215 220 

15 Tyr Asp His Leu Lys Asn Gin Pro Glu Leu Gly Gly Tyr Gin He Ser 
225 230 235 240 

He Pro Gin Lys Gly Val Val Asp Lys Arg Gly Lys Arg Lys Asn Arg 
245 250 255 

Pro Ala Arg Lys Ala Ser Leu Ser Leu Arg Ser Gly Arg He Thr Leu 
20 260 265 270 

Lys Gin Gly Asn He Thr Leu Asn Ala Val Leu Ala Glu Glu He Asn 
275 280 285 

Pro Pro Lys Gly Glu Thr Pro Leu Lys Trp Leu Leu Leu Thr Ser Glu 
290 295 300 

25 Pro Val Glu Ser Leu Ala Gin Ala Leu Arg Val He Asp He Tyr Thr 
305 310 315 320 

His Arg Trp Arg He Glu Glu Phe His Lys Ala Trp Lys Thr Gly Ala 
325 330 335 

Gly Ala Glu Arg Gin Arg Met Glu Glu Pro Asp Asn Leu Glu Arg Met 
30 340 345 350 
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Val Ser lie Leu Ser Phe Val Ala Val Arg Leu Leu Gin Leu Arg Glu 
355 360 365 

Ser Phe Thr Leu Pro Gin Ala Leu Arg Ala Gin Gly Leu Leu Lys Glu 
370 375 380 

5 Ala Glu His Val Glu Ser Gin Ser Ala Glu Thr Val Leu Thr Pro Asp 
385 390 395 400 

Glu Cys Gin Leu Leu Gly Tyr Leu Asp Lys Gly Lys Arg Lys Arg Lys 
405 410 415 

Glu Lys Ala Gly Ser Leu Gin Trp Ala Tyr Met Ala lie Ala Arg Leu 
10 420 425 430 

Gly Gly Phe Met Asp Ser Lys Arg Thr Gly He Ala Ser Trp Gly Ala 
435 440 445 

Leu Trp Glu Gly Trp Glu Ala Leu Gin Ser Lys Leu Asp Gly Phe Leu 
450 455 460 

15 Ala Ala Lys Asp Leu Met Ala Gin Gly He Lys He 
465 470 475 



<210> 3 

<211> 19 

<212> DNA 

20 <213> Transposon Tn5 

<400> 3 

ctgactctta tacacaagt 



<210> 4 
<211> 19 
25 <212> DNA 

<213> Transposon Tn5 

<400> 4 

ctgtctcttg atcagatct 
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<210> 5 
<211> 19 
<212> DNA 

<213> Transposon Tn5 



<400> 5 

ctgtctcttg aacagatct 
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